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ABSTRACT 

The  Comprehensive  Nuclear-Test-Ban  Treaty  (CTBT),  a  global  ban  on  nuclear  explosions,  is  currently  pre-entry 
into  force.  Under  the  CTBT,  a  monitoring  system  of  seismic,  hydroacoustic,  infrasonic  and  radionuclide  sensors 
operates  and  data  from  this  system  is  analyzed  by  the  International  Data  Centre  (IDC).  The  IDC  provides  CTBT 
signatories  basic  seismic  event  parameters  and  a  screening  analysis  indicating  whether  an  event  exhibits  explosion 
characteristics  (for  example,  shallow  depth).  An  important  component  of  the  screening  analysis  is  a  statistical  test  of 
the  the  null  hypothesis  HO:  Explosion  Characteristics  using  empirical  measurements  of  seismic  energy  (magnitudes). 
Relative  to  mb,  earthquakes  generally  have  a  larger  Ms  magnitude  than  explosions.  This  paper  proposes  a 
hypothesis  test  (screening  analysis)  using  Ms  and  mb  that  expressly  accounts  for  physical  correction  model 
inadequacy  in  the  standard  error  of  the  test  statistic.  With  this  hypothesis  test  formulation,  the  2009  Democratic 
People's  Republic  of  Korea  (DPRK)  announced  nuclear  weapon  test  fails  to  reject  the  null  hypothesis  HO:  Explosion 
Characteristics. 
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OBJECTIVES 

Network  averaging  cannot  overcome  bias  in  a  magnitude  correction  model.  A  new  formulation  of  the  CTBT 
Ms  |  mb  seismic  event  screening  hypothesis  test  is  developed  and  demonstrated  in  this  paper  (Ms  |  mb  denotes  the 
conditional  probability  model  Ms  given  mb).  The  formulation  properly  partitions  total  error  into  Model  Error  and 
Station  Noise,  and  this  partition  provides  for  the  correct  reduction  of  the  standard  error  of  the  hypothesis  test.  With 
the  correct  hypothesis  test  formulation,  a  decision  of  fail  to  reject  the  null  hypothesis  of  Explosion  Characteristics 
(denoted  HO  or  HO:  Explosion  Characteristics)  is  made  with  a  network  of  27  global  International  Seismological 
Centre  (ISC)  seismic  stations  for  the  2009  DPRK  nuclear  weapon  test.  Without  the  proper  error  partition  in  the 
standard  error,  HO  is  rejected  and  the  2009  DPRK  nuclear  weapon  test  is  screened  out. 

RESEARCH  ACCOMPLISHED 

The  CTBT,  in  basic  obligation,  is  simple  and  direct.  Article  I  reads: 

•  Each  State  Party  undertakes  not  to  carry  out  any  nuclear  weapon  test  explosion  or  any  other  nuclear 
explosion,  and  to  prohibit  and  prevent  any  such  nuclear  explosion  at  any  place  under  its  jurisdiction  or 
control. 

•  Each  State  Party  undertakes,  furthermore,  to  refrain  from  causing,  encouraging,  or  in  any  way  participating 
in  the  carrying  out  of  any  nuclear  weapon  test  explosion  or  any  other  nuclear  explosion. 

Protocol  for  the  CTBT  calls  for  the  implementation  of  a  monitoring  system  and  associated  IDC  with  the 
responsibility  to  “receive,  collect,  process,  analyse,  report  on  and  archive  data  from  monitoring  system  facilities.” 
Protocol  also  directs  the  IDC  to  provide  an  event  screening  service  to  the  treaty  signatories,  the  technical  details  of 
which  are  specified  in  the  CTBT  operational  manual. 

Seismic  Event  Screening  for  the  International  Data  Centre 

Seismic  energy  is  generated  by  earthquakes,  volcanoes,  mining  and  oil  exploration  explosions,  natural  fracturing  of 
large  rocks,  large  above-ground  explosions  and  nuclear  weapon  tests.  A  seismic  waveform  is  a  measured  transient 
time  series  of  this  energy  with  distinct  segments  (phases).  Surface  wave  energy  relative  to  the  energy  in  the  initial 
P-phase,  is  the  basis  for  an  IDC  event  screening  hypothesis  test  (Ms  |  mb).  The  path  and  distance  between  event  and 
stations  is  different  and  if  the  phase  energy  measurements  from  each  station  could  be  accurately  corrected  for  path 
effects,  the  measurements  would  represent  energy  at  the  source.  Station  magnitudes  are  averaged  to  estimate  an 
event  (network)  magnitude.  Many  surface  wave  corrections  and  scales  have  been  developed  during  the  past  century. 
Most  notable  are  scales  with  distance  corrections  (Gutenberg  [1945],  Vanek  et  al.  [1962],  von  Seggem  [1977], 

Herak  and  Herak  [1993],  Rezapour  and  Pearce  [1998],  Stevens  and  McLaughlin  [2001],  Bormann  et  al.  [2009]),  and 
scales  with  corrections  for  filter  effect,  distance  and  path  (Marshall  and  Basham  [1972],  Russell  [2006]).  With 
calibration  analysis,  the  hypothesis  formulation  developed  in  this  paper  is  applicable  to  all  commonly  used  Ms 
calculations.  In  general,  the  model  for  computed  station  Ms  is 

Ms  =  loglO(Amplitude)  +  Path  +  Distance  +  Filter  Effect.  (1) 

The  Rayleigh  wave  magnitude  Ms  is  further  modeled  as  a  function  of  mb  giving  the  conditional  station  magnitude 
Ms  |  mb.  The  IDC  provides  CTBT  signatories  basic  seismic  event  parameters  including  event  location  and  depth, 
measures  of  event  size  (magnitudes),  and  a  screening  analysis  for  events  with  mb  greater  than  3.5. 

The  physical  basis  of  the  Ms  |  mb  discriminant  is  quite  mature  (see  Douglas  et  al.  [1971]  and  Stevens  and  Day 
[1985]),  and  is  based  on  the  physics  that  for  a  given  mb,  a  shallow  earthquake  excites  relatively  more  surface-wave 
energy  than  a  single-point  explosion.  However,  deep  earthquakes  like  explosions  have  small  Ms  for  their  mb.  This 
means  that  single-point  underground  explosions  and  deep  earthquakes  will  usually  fail  to  reject  the  null  hypothesis 
of  explosion  characteristics  in  the  Ms  |  mb  screening  analysis.  Erroneously  screening  out  a  nuclear  explosion  in 
IDC  analysis  is  clearly  serious  and  the  ramifications  of  such  an  error  could  be  political,  financial,  environmental  -  or 
most  serious,  the  loss  of  life  though  military  provocation.  IDC  seismic  event  screening  includes  a  statistical  test  of 
the  null  hypothesis  that  a  seismic  event  has  explosion  characteristics,  with  a  very  small  probability  of  incorrectly 
rejecting  this  hypothesis.  This  conservative  IDC  event  screening  analysis  retains  events  of  no  concern  as  well  as 
explosions  in  a  “fail  to  reject”  bulletin  provided  to  CTBT  signatories.  A  technical  review  of  general  seismic 
monitoring  is  provided  in  Anderson  et  al.  (2010). 
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Statistical  Model 

The  probability  model  of  Ms  corrected  for  mb  (Ms  |  mb)  is 

Y  =  Ms  -  r|(mb)  =  |i  +  Model  Error  +  Station  Noise  (2) 

where  r](mb)  is  a  model  of  the  magnitude  near  the  source  and  Station  Noise  represents  measurement  and  ambient 
noise  with  zero  mean.  Note  that  Equation  (2)  is  a  regression  model  of  Ms  versus  mb  embedded  into  a  simple  random 
effects  analysis  of  variance  model.  Model  Error  is  a  zero  mean  random  effect  that  varies  from  event  to  event  and 
represents  correction  model  inadequacy  from  local  effects  such  as  inaccurate  depth  and  local  material  properties, 
fdter/path/distance  corrections,  and  magnitude  corrections.  These  effects  are  in  fact  physical  and  deterministic,  yet 
realistically  unknown.  The  technical  approach  in  Equation  (2)  is  to  model  these  effects  as  random  and  properly 
include  the  Model  Error  variance  component  in  calculations  of  the  standard  error  for  the  hypothesis  test.  The 
variance  component  for  Model  Error  decreases  with  improved  corrections  and  physical  theory,  and  the  term  for 
Station  Noise  in  the  standard  error  is  reduced  through  station  averaging.  Importantly,  station  averaging  cannot 
reduce  Model  Error.  For  IDC  screening  analysis  a  simple  linear  regression  formulation,  q(mb)  =  P  x  mb,  is  used  for 
both  the  null  and  alternate  hypothesis.  The  variance  component  for  Model  Error  is  equal  across  both  the  null  and 
alternate  hypotheses  as  is  the  variance  component  for  Station  Noise.  The  differences  in  the  models  for  the  two 
hypotheses  is  represented  through  differences  in  the  null  and  alternate  population  means  p.  The  the  hypothesis  test 
constructed  from  Equation  2  is  composed  of  a  more  sophisticated  and  correct  standard  error  that  includes  the 
variance  components  for  Model  Error  and  Station  Noise. 

Demonstration  Analysis 

The  model  component  q(mb)  =  P  x  mb  is  a  physical  correction  and  is  assumed  known  (for  IDC  event  screening  P  is 
known).  Selby  et  al.  (2011)  determine  that  P  =  1  adequately  represents  the  general  physical  scaling  relationship 
between  Ms  versus  mb,  so  that  corrected-for-magnitude  network  surface  energy  is  Ms  -  mb.  The  null  and  alternate 
hypothesis  population  means,  and  variance  components  for  Model  Error  and  Station  Noise,  are  assumed  known 
through  calibration  analysis  demonstrated  in  the  following  sections,  and  the  calibrated  screening  hypothesis  test  is 
applied  to  the  2009  DPRK  nuclear  weapon  test  (NWT).  The  2009  DPRK  NWT  was  not  included  in  the  calibration 
analysis. 

The  Ms  given  mb  discriminant  is  demonstrated  with  seismic  event  data  acquired  from  the  ISC  and  the  AWE 
Blacknest  Seismological  Centre  (BSC).  The  events  acquired  from  the  ISC  and  BSC  spanned  1964  to  2000,  and 
included  USSR,  Chinese,  and  French  underground  nuclear  explosion  tests  -  59  explosions  for  the  HO  population; 
and  earthquakes  and  mining  activity  -  129  events  for  the  HA  population.  The  ISC  and  BSC  event  catalogues  provide 
the  mb  and  the  individual  station  Ms  values  for  each  event. 

Bootstrap  calibration  analysis  assumes  that  the  calibration  data  set  is  a  representative  sample  although  possibly  not 
large  enough  to  adequately  represent  extremes  necessary  to  confidently  estimate  the  variance  components. 
Calibration  analysis  is  accomplished  with  bootstrap  sampling  of  corrected  Ms  values  (Ms  -  P  x  mb),  and  the 
associated  event  and  station  indices.  The  calibration  data  included  59  +  129  =  188  calibration  events  each  observed 
by  varying  numbers  and  locations  of  stations.  For  example,  one  event  had  56  stations  observing,  and  another  had 
50  stations  observing.  Also,  12  events  had  3  stations  observing  and  50  events  had  two  stations  observing  (at  least 
two  stations  were  required  for  an  Ms  calculation  in  the  analysis).  All  events  with  stations  observing  gave  1906  total 
event/station  records,  and  so  each  bootstrap  sample  had  1906  randomly  selected  records.  A  total  of  5000  bootstrap 
samples  were  taken  in  the  calibration  analysis.  For  each  bootstrap  sample,  the  null  and  alternate  means  |i(l  and  jiA 
were  computed.  Also,  for  each  bootstrap  sample,  the  variance  components  for  the  one-way  random  effects  model 
were  computed.  This  collection  of  5000  bootstrap  variance  components  provides  a  technically  reasonable  approach 
to  inflate  the  calibration  variance  components  to  more  conservative  values  (e.g.,  the  95th  quartile). 

Screening  Analysis  on  the  2009  DPRK  Nuclear  Weapon  Test 

The  2006  and  2009  DPRK  announced  NWTs  had  large  network  Ms  relative  to  mb  (see  Bonner  et  al.  [2008]  and 
Patton  and  Taylor  [2008]  for  research  on  the  2006  DPRK  NWT).  This  resulted  in  reconsidering  event  screening 
processes  at  the  IDC  which  led  to  the  recommendation  by  Selby  et  al.  (20 1 1)  of  P  =  1 .  The  development  in  this 
paper  builds  on  that  finding.  The  ISC/BSC  data  used  in  this  analysis  did  not  have  Ms  measurements  for  the  2006 
DPRK  NWT.  For  the  2009  DPRK  NWT,  twenty  seven  Ms  measurements  were  reported  in  the  ISC  bulletin  and  the 
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event  mb  magnitude  was  4.62.  Application  of  the  proposed  hypothesis  test  formation  in  this  paper  gave  a  “fail  to 
reject  HO”  p  -  value  =  0.15.  The  same  analysis  with  the  hypothesis  formulation  with  Model  Error  rejects  HO  with  a 
p  -  value  approximately  equal  to  zero  -  the  wrong  decision. 

CONCLUSIONS  AND  RECOMMENDATIONS 


Network  averaging  cannot  reduce  bias  in  an  magnitude  correction  model.  A  new  formulation  of  CTBT  seismic  event 
screening  hypothesis  test  is  developed  and  demonstrated  in  this  paper.  The  formulation  properly  partitions  total  error 
into  Model  Error  and  Station  Noise,  and  this  partition  provides  for  the  correct  reduction  of  the  standard  error  of  the 
hypothesis  test.  With  the  correct  hypothesis  test  formulation,  a  decision  of  “fail  to  reject  HO:  Explosion 
Characteristics”  is  made  with  a  network  of  27  global  ISC  seismic  stations  for  the  2009  DPRK  nuclear  weapon  test. 
Without  the  proper  error  partition  in  the  standard  error,  HO  is  rejected  and  the  2009  DPRK  nuclear  weapon  test  is 
screened  out. 
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